A Bayesian Discretizer for Real-Valued Attributes

نویسنده

  • Xindong Wu
چکیده

Discretization of real-valued attributes into nominal intervals has been an important area for symbolic induction systems because many real world classiication tasks involve both symbolic and numerical attributes. Among various supervised and unsupervised discretization methods, the information gain based methods have been widely used and cited. This paper designs a new discretization method, called the Bayesian discretizer, and compares its performance with some other methods including the information gain methods implemented in C4.5 and HCV (Version 2.0). Over the 7 tested datasets, the Bayesian dis-cretizer has the best results for 3 in terms of predictive accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Discretizer for Knowledge Discovery Approaches Based on Rough Sets

Knowledge discovery approaches based on rough sets have successful application in machine learning and data mining. As these approaches are good at dealing with discrete values, a discretizer is required when the approaches are applied to continuous attributes. In this paper, a novel adaptive discretizer based on a statistical distribution index is proposed to preprocess continuous valued attri...

متن کامل

A Bayesian Neural Network Model with Extensions a Bayesian Neural Network Model with Extensions

This report deals with a Bayesian neural network in a classiier context. In our network model, the units represent stochastic events, and the state of the units are related to the probability of these events. The basic Bayesian model is a one-layer neural network, which calculates the posterior probabilities of events, given some observed, independent events. The formulas underlying this networ...

متن کامل

The Use of a Bayesian Neural Network Model for Classification Tasks

This thesis deals with a Bayesian neural network model. The focus is on how to use the model for automatic classification, i.e. on how to train the neural network to classify objects from some domain, given a database of labeled examples from the domain. The original Bayesian neural network is a onelayer network implementing a naive Bayesian classifier. It is based on the assumption that differ...

متن کامل

Discretization of Continuous-valued Attributes and Instance-based Learning

Recent work on discretization of continuous-valued attributes in learning decision trees has produced some positive results. This paper adopts the idea of discretization of continuous-valued attributes and applies it to instance-based learning (Aha, 1990; Aha, Kibler & Albert, 1991). Our experiments have shown that instance-based learning (IBL) usually performs well in continuous-valued attribu...

متن کامل

The ring of real-valued functions on a frame

In this paper, we define and study the notion of the real-valued functions on a frame $L$. We show that $F(L) $, consisting of all frame homomorphisms from the power set of $mathbb{R}$ to a frame $ L$, is an $f$-ring, as a generalization of all functions from a set $X$ into $mathbb R$. Also, we show that $F(L) $ is isomorphic to a sub-$f$-ring of $mathcal{R}(L)$, the ring of real-valued continu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Comput. J.

دوره 39  شماره 

صفحات  -

تاریخ انتشار 1996